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ABSTRACT Telomere length, an important feature of life span control, is dependent on the activity of telomerase 
(a key enzyme of the telomere-length-maintaining system). Telomerase RNA is a component of telomerase and, 
thus, is crucial for its activity. The structures of telomerase RNA genes and their promoter regions were com- 
pared for the long-living naked mole rat and different organisms. Two rare polymorphisms in Heterocephalus 
glaber telomerase RNA (hgTER) were identified: A~>G in the first loop of pseudoknot P2b-p3 (an equivalent of 
Hint in hTR) and G^A in the scaRNA domain CR7-p8b (an equivalent of 421nt in hTR). Analysis of TER pro- 
moter regions allowed us to identify two new transcription factor binding sites. The first one is the ETS family 
site, which was found to be a conserved element for all the analyzed TER promoters. The second site is unique for 
the promoter region of TER of the naked mole rat and is a binding site for the SOX17 transcription factor. The 
absence of one Spl site in the TER promoter region of the naked small rat is an additional specific feature of the 
promoter area of hgTER. Such variation in the hgTER transcription regulation region and hgTER itself could 
provide increased telomerase activity in stem cells and an extended lifespan to H. glaber. 
KEYWORDS H. glaber; telomerase RNA; bioinformatics; promoter analysis; comparative genomics. 
ABBREVIATIONS TER - telomerase RNA; hTR - human telomerase RNA; TERT - telomerase catalytic subunit; 
hTERT - human telomerase catalytic subunit; hgTER - Heterocephalus glaber telomerase RNA. 



INTRODUCTION 

Several studies have shown that stem cell function is 
impaired during aging [1 _ 3]. A decrease in stem cell 
function may contribute to impaired maintenance and 
the function of some tissue during aging. The role of 
telomere shortening was identified to be the mecha- 
nism contributing to the accumulation of DNA dam- 
age in replicative aging of stem cells [4]. Experiments 
in mTER— /— mice with telomere shortening have pro- 
vided the experimental evidence that this process can 
impair the function of somatic and germline stem cells 
[5]. Telomerase is a key component of the telomere- 
length-maintaining system and an important contribu- 
tor to the reduction of replicative senescence in germ 
and stem cells [6, 7]. Temporary telomerase reactivation 
in late-generation TERT-deficient mice extends tel- 
omeres, reduces DNA damage signaling and associated 



cellular checkpoint responses, and eliminates degen- 
erative phenotypes across multiple organs, including 
testes, spleen, intestine, and even neurons [7]. Moreo- 
ver, temporary telomerase expression in aged normal 
mice significantly increases the lifespan of mice [8]. 

Telomerase synthesizes new telomere repeats at the 
G-strand and thus participates in the compensation for 
telomere loss during replication [9]. Two components 
are required for telomerase activity in vitro: a reverse 
transcriptase catalytic subunit (TERT) and telomer- 
ase RNA (TER) that contains a template for telomere 
synthesis [10]. TERT was shown to play a role in a 
number of cellular processes (cell cycle response, oxi- 
dative stress, antiapoptotic action, etc. [11]) outside of 
the telomerase complex. No hTERT was found in most 
differentiated normal tissues, although a low level of 
hTERT could be detected in the skin, spleen, stomach 
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and small intestine and a higher level was detected in 
testes and the endometrium [12]. In contrast, hTR ex- 
pression was detected in many normal tissues, includ- 
ing testes, ovary, brain, liver, small intestine, thymus, 
kidney, and prostate, suggesting that telomerase RNA 
may also have alternative functions [13]. In some can- 
cer cell lines, the level of TERT expression is critical 
for telomere elongation; however, in case of stem cells 
in a living organism it is the high level of TER expres- 
sion that is more important for telomere elongation. In- 
deed, an analysis of interspecies crosses of TER- and 
TERT-deficient mice [14] showed that the increase in 
the gene copy number of TER, but not TERT, is what 
is critical in telomere elongation. Ectopic expression of 
hTER caused telomere elongation in bovine blastocysts, 
whereas co-expression of hTERT and hTER did not 
result in further increase in the telomere length [15], 
providing further evidence of the fact that the level of 
telomerase RNA is critical for telomerase activity and 
telomere elongation in the cell within the organism. 

The genome and transcriptome of the naked mole 
rat have recently been sequenced [16, 17]. Hetero- 
cephalus glaber, the naked mole rat (H. glaber), has a 
very high life expectancy among rodents (> 28 years 
vs 1.5-7 years in other rodents), high resistance to car- 
cinogenesis and retarded aging [18]. 

A comparative study of the H. glaber genome can 
help reveal the reasons for the surprisingly long lifespan 
of this animal. A number of genetic alterations have al- 
ready been found, which could explain the increase in 
the DNA repair level, as well as the reduced oxidative 
damage or reduced replicative senescence [5, 19]. An- 
other reason for the longevity could be the higher level 
of telomerase activity in H. glaber stem cells or telomer- 
ase reactivation under a certain type of stimulus. In this 
study, we have compared the structure of telomerase 
RNA genes and their promoter regions for the long- 
living naked mole rat and different organisms with an 
aim to identify the features that may increase hgTER 
expression and telomerase activity in stem cells. 

MATERIAL AND METHODS 

Comparison of TER sequences 

To search for the hgTER gene of H. glaber, we used 
the whole genome shotgun project entries with IDs: 
AFSB (GenBank: AFSB00000000.1) and AHKG (Gen- 
Bank: AHKG00000000.1). We used ClustalW to build an 
alignment of the sequences of interest with the already 
well-described telomerase RNA genes. The following 
reference sequences were used: Cavia porcellus TER 
(GenBank: AF221929.1), Cavia porcellus WGS assem- 
bly (GenBank: AAKN00000000.2), Chinchilla chinchil- 
la TER (GenBank: AF221937.1), Chinchilla chinchilla 



WGS assembly (GenBank: AGCD00000000.1), Mus 
musculus TER (GenBank: NR_001579.1), Mus mus- 
culus chromosome 3 (GenBank: NC_000069.6), Rat- 
tus norvegicus TER (GenBank: NR_001567.1), Rattus 
norvegicus chromosome 2 (GenBank: NC_005101.3), 
human TER (GenBank: NR_001566.1), human chro- 
mosome 3 (GenBank: NC_000003.11), Danio rerio TER 
(GenBank: EF569636.1), and chromosome 25 (GenBank: 
CU651628.3). 

The alignment analysis and the influence of poly- 
morphisms on the secondary structure were performed 
manually. The sequences of TERs from Suncus murinus 
(GenBank: AF221921), Geomys breviceps (GenBank: 
AF221930), Microtus ochrogaster (GenBank: AF221909), 
Mus spretus (GenBank: AY058901), Mus musculus 
(GenBank: AY058900), Dasyurus hallucatus (GenBank: 
AF221919), Bufo japonicus (GenBank: AF221913) and 
Typhlonectes natans (GenBank: AF221910) were used to 
perform a secondary structure analysis. 

Comparison of TER promoter areas 
We searched for promoter regions using the Jaspar da- 
tabase [20], restricting the search to the Jaspar CORE 
Vertebrate with a 99-100% relative profile threshold. 
ConSite with a 85-95% TF score cutoff was used for 
further analysis of the promoter sequences [21]. Rela- 
tive scores were used as normalized score values for the 
quantitative evaluation of hit significances [22]. Hit cor- 
rections were done manually where necessary. Visuali- 
zation of multiple alignments was corrected manually. 

RESULTS 

Identification of H. glaber TER 

The full hgTERC gene (Heterocefalus glaber TER) was 
identified by local BLAST on the basis of the H. gla- 
ber genome assembly (WGS record AHKG) and a com- 
parison with multiple alignment data for mammalian 
TER sequences [23]. The final alignment is available at 
93.180.62.254/hgTERC/ESM_l.pdf. 

According to phylogenetic data, the closest relatives 
of H. glaber are Hydrochoerus hydrochaeris, Cavia por- 
cellus, Chinchilla chinchilla, and Myocastor coypus [18]. 
TER sequences are known only for Chinchilla chinchil- 
la (GenBank: AF221937.1) and Cavia porcellus (Gen- 
Bank: AF221929.1); those were used for further struc- 
ture comparison. Furthermore, the data for human 
TER were used for the analysis due to the availability 
of detailed information about the promoter region and 
secondary structure of TER. 

Comparison of TER promoter areas 

A 500 nt (from the expected transcription start site) 

promoter region was used for the analysis, since the 
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major regulatory elements were found in this area for 
human TER (hTER) [24]. The web services JASPAR 
[20] and ConSite [21] were used to search for transcrip- 
tion factor binding sites in the promoter area of H. gla- 
ber. These tools provide the possibility to analyze any 
sequence data; they are relatively simple in operation 
and allow one to deal with large-position weight matri- 
ces with optimal results due to the effective job filter- 
ing. We used strict filtering of the results for both tools 
to find the most reliable sites. 

This approach reduced the number of predicted 
transcription factor binding sites in the hTER promoter 
area compared to the number of sites determined ear- 
lier [24]. The elements in the promoter region of TERs 
from four different species identified in this study are 
shown at 93.180.62.254/hgTERC/ESM_2.pdf (the full 
promoter map is available at 93.180.62.254/hgTERC/ 
ESM_3_old.pdf). The elements found in the promoter 
region of hgTER are as follows: TATA box in the prox- 
imity of the transcription start site, NF-Y site with the 
conserved CCAAT box, three SP1 sites, ELK4 site, and 
SOX 17 binding site. 

The ELK4 transcription factor binding site had 
never been identified for any TER before; it is located 
approximately 170 nt upstream from the start of the 
TER coding region. We have found this promoter ele- 
ment for all tested sequences; p values were calculated 
by MAST. The ELK4 score for the H. glaber was 14.059 
(the relative score 0.9999; p = 3.3-10' 6 ); 11.056 (0.9034, 
4.1-10" 5 ) for Cavia porcellus; 12.053 (0.9311, 2.2-10 5 ) for 
Chinchilla chinchilla; and 10.398 (0.9396, 5.5-10 5 ) for 
humans, thus meaning a very high probability (> 90%) 
of ELK4 transcription factor binding site occurrence in 
the TER promoter area, at least from the bioinformatic 
point of view. 

When performing the search, we found that the 
ELK4 binding site [25] matrix in the JASPAR database 
was outdated [25-27] and this led to a false identifica- 
tion for most proteins containing the ETS domain, ex- 
cept for families I and II [28]. Since we are dealing with 
a very small set of sequences and the difference be- 
tween the new and old matrices is negligible, we used 
the old position weight matrix (PWM). 

Multiple alignments showed that a ETS binding site 
was present in all four species: human, Cavia porcellus, 
H. glaber, and Chinchilla chinchilla. The binding sites 
found have variations in positions 1, 8, and 9 of PWM, 
which is consistent with the known ETS binding sites 
[26, 27, 29]. Thus, we suggest that the identified site is a 
regulatory element for the hgTER promoter. 

The SOX 17 binding site was detected solely in the H. 
glaber TER promoter region; it is located ~ 430 nt up- 
stream from the transcription start site. The fact that 
the SOX 17 site is present in H. glaber but not in other 



evolutionary related animals may be an indication of an 
important difference between these species. 

The SOX17 protein contains an HMG_box (Pfam 
domain high-mobility group box) responsible for the 
high-affinity binding to non-B-type DNA conforma- 
tions (kinked or unwound) [30]. The characteristic 
binding motif in DNA is almost identical for the entire 
HMG_box family [31, 32]; however, for the transcrip- 
tion factor SOX17 [33, 34] harboring the Sox_C_TAD 
domain (Pfam: PF12067), the binding sites are largely 
different from the canonical site - AACAAT [32, 35]. 

Based on the common architecture for most TERs, we 
assumed that hgTER also contains the known second- 
ary structure elements [23]. We mainly focused on the 
rare TER polymorphisms in the important functional 
elements of the telomerase RNA secondary structure. 

The comparison of hgTER with the TER of H. glab- 
er' s closest relatives (guinea pig and chinchilla) and the 
TER of model organisms (rat, mouse, and humans) re- 
vealed a number of differences. Although most chang- 
es in H. glaber TER were not unique (present in other 
mammalians) and did not affect the functional elements 
of TER, we managed to find two rare polymorphisms in 
the functional region of hgTER. The mapped polymor- 
phisms are shown in Figs, la and lb. 

The first polymorphism was a A~>G replacement in 
the first loop of pseudoknot P2b-p3 at position 111 (ac- 
cording to the hTER nomenclature). This substitution 
gives rise to the non-canonical "G— U" pair in the pseu- 
doknot region. Most other TERs have a canonical pair 
at this position. Moreover, for TERs that have polymor- 
phism A-'Glll additional replacement \J->Cn9 takes 
place that restored the canonical WC pair between 
bases in 111 and 179 positions (e.g., in Geomys breviceps 
and Microtus ochrogaster) [23] The only example of the 
same G— U pair was revealed in TERs of the Dasyurus 
hallucatus and Suncus murinus [23]. 

The second polymorphism is the replacement G~>A 
in the stem of the scaRNA domain CR7-p8b at position 
421 (according to the hTER nomenclature). This substi- 
tution gives rise to the non-canonical "C _ A" pair in the 
p8b stem terminus element at position 421. Most TERs 
have a canonical base pair at this position. Amphibians 
(toads and typhlonectes) have G421-»A replacement 
accompanied by C^U transition at position 408, which 
restores the WC pair [23]. Close relatives - rodents 
(chinchilla, guinea pig, mice)-have polymorphisms that 
cause a disturbance in the p8b stem at different posi- 
tions [23], but this particular substitution at position 421 
with the non-canonical pair is unique to H. glaber. 

DISCUSSION 

Telomerase RNA is a crucial telomerase component, 
and increased expression of telomerase RNA and te- 



VOL. 6 JYo2(21) 2014 | ACTA NATURAE | 43 



RESEARCH ARTICLES 



A 

a aaa l gucagcg 
ugcgaciMi!) cgg^jcg 

GCGC^G 



u 



.u 



u 



H. glaber 



r —ire 
A 



120 

I 



180 

a aaa l gucagcg 



GGCGAClJlljL' cagijcg 
GCGClLlG 110 



ioT U U U 



.u 



H. sapiens 



"AAA^UCAGCC 

GGCGAClW CGG^ICG 

UCGC^IG G 
U r U 

u uuU G 

D. hallucatus 



A 

A AAA L GUCAGCG 
UGCGAClWl!) ca'gl^cg 
gcgci!jg c 



u 



u 



u 



u u u 

C/i. chinchilla 





410 — 



„420 




C. porcellus 




H. sapiens 



u' ®- 
Ch. chinchilla 




B. japonicus 
T. natans 




M. musculus 



Fig. 1 . The P2b-p3 and CR7-p8b areas of pseudoknot with the mapped polymorphisms for different species. The 
polymorphism is marked with a circle; The Watson-Crick base paring is marked with a solid line; non-canonical base 
pairing is marked with a dashed line. A - the P2b-p3 areas of pseudoknot for H. sapiens, Ch. chinchilla, H. glaber 
and D. Hallucatus. B — the CR7-p8b area of the scaRNA TER domain with the mapped polymorphism for H. sapiens, 
Ch. chinchilla, C. porcellus, H. glaber, M. Musculus, Bombina japonic), and Typhlonectes natans 



lomerase activity in stem cells or other tissues at dif- 
ferent stages of an animal's development could be an 
essential reason for its long lifespan. Comparison of the 
TER gene promoter region and TER in H. glaber with 
available data on other species allowed us to reveal 
variations both in the promoter region and telomerase 
RNA structure. 

An analysis of the region 500 nt upstream of hgTER 
transcription start side allowed us to identify the reg- 
ulatory elements known for other organisms [24] and 
two new ones: the ETS site, which is present in all four 
model organisms, and the SOX17 site presented only 
in H. glaber (Fig. 2). All common elements are located 
within the ~ 270 nt area, in agreement with the DNase 
1 protection data for the human regulatory region [36]. 
This region contains the TATA box in the proximity 
of the transcription start site, a NF-Y site with a con- 
served CCA AT box, SP1 sites, and the newly identi- 
fied ETS site. In humans, four Spl (Spl.l, Spl.2, Spl.3 
and Spl. 4) sites were previously identified. An analysis 
of H. glaber and its close relatives revealed that one or 
more Spl sites can be missing in a particular organism. 
For example, the Spl.2 site is missing in all rodents {Fig. 
2) and the Spl.3 site is also absent in Cavia porcellus. 



In case of humans, the two transcription factors Spl 
and Sp3 can bind to the Sp-sites within the promoter. 
Spl stimulates expression, while Sp3 induces dose- 
dependent repression [36]. The sites adjacent to the 
CCAAT box from either side (Spl.l only for H. glaber) 
are thought to cooperate with NF-Y to mediate posi- 
tive or negative regulatory effects in humans [37]. The 
Spl.3 and Spl.4 sites adjacent to the transcription start 
site could also regulate transcription, either positively 
or negatively, depending on the presence of other pro- 
teins that interact with transcription factors [38]. The 
context of a particular Spl site was suggested as es- 
sential for the preferential binding of either Spl or Sp3 
factors, which might influence the TER transcription 
regulation [38]. Thus, the absence of the Spl.2 site in ro- 
dents may result in differences in the fine regulation of 
TER transcription via the Sp pathway in rodents, mak- 
ing it more dependent on the particular context of the 
remaining Sp sites. In case of H. glaber, this may have a 
positive effect on the TER transcription efficiency. 

The newly identified ELK4 site is located within the 
272 nt area further downstream from the transcription 
start site. It was found in all the studied species, includ- 
ing humans. ELK4 is a member of the ETS family of 
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Fig. 2. Schematic annotation of the TER promoter areas for humans, guinea pig, mole rat, and chinchilla. The reference 
promoter elements are marked as gray rectangles, and the putative elements predicted in this study are marked as 
hatched rectangles 



transcription factors [29]. For H. glaber, the sequence 
of this site is identical to that of ELK4, but it can also 
be used by the other members of the ETS family [28]. 
This factor was identified as a novel target for the an- 
drogen receptor-activating cascade. The fact that an- 
drogen signaling blockade in the case of prostate can- 
cer reduced telomerase activity indirectly proves that 
ELK4 participates in the regulation of TER transcrip- 
tion [25]. 

A SOX17 binding site was found only in H. glaber. It 
is located approximately 430 nt upstream of the gene 
region, and thus outside of the 272 nt promoter area, 
which was previously shown to be important for hTER 
transcription. SOX17 belongs to the family of HMG- 
like SOX proteins. The SOX 17 binding site (AC A AT) 
is identical for the other members of the SOX pro- 
teins, and binding of a particular factor depends on the 
broader context around the conventional site. SOX 17 
(SRY-box 17) is a transcription factor involved in the 
regulation of several developmental processes [39, 40], 
including endoderm formation, vascular development, 
and fetal hematopoietic stem cell maintenance. 

Soxl7 is highly restricted in its expression within the 
hematopoietic system to fetal hematopoietic stem cells 
(HSCs) [41]. It has recently been shown that Soxl7 ex- 
pression confers self-renewal potential and fetal stem 
cell characteristics to adult hematopoietic progenitors 
[42]. Other SOX proteins are involved in the regulation 
of various cellular processes. A lack of data does not al- 
low one to propose a particular regulation pathway for 
the SOX binding site, but the presence of this site is an 
additional possibility for H. glaber to regulate the ex- 
pression of telomerase RNA and to increase the level of 
telomerase activity, especially in fetal stem cells. This 



correlates with earlier studies, where long-living ro- 
dent species (such as the H. glaber and Sciurus caro- 
linensis ) have a higher telomerase activity than mice 
[43]. 

Telomerase activity depends not only on the tran- 
scriptional level of telomerase components; many oth- 
er processes are involved, including TER maturation, 
transport, telomerase assembly, interaction between 
TER and TERT, etc. Mutations in telomerase RNA can 
influence these processes. 

We found two rare polymorphisms in hgTER: 
All l^G and G421^A. The All l^G transition in hgTER 
is located in the stem loop of P2b-p3 pseudoknot. The 
P2b-P3 pseudoknot {Fig. la) is highly conserved [23]. 
The effect of this mutation on the function of telomer- 
ase is unknown, but mutations destabilizing the pseu- 
doknot structure affect telomerase activity and lead 
to aplastic anemia, myelodysplasia, and leukemia in 
humans [44]. Moreover, mutations that destabilize the 
pseudoknot structure reduce telomerase activity [45] 
and lead to dyskeratosis congenita [46]. Polymorphism 
in this position in other organisms is accompanied by 
the second mutation that restores the canonical pair. 
Due to the A-*G replacement in H. glaber, a non-canoni- 
cal G-TJ pair is formed. In contrast to other non-canon- 
ical pairs, G-TJ causes very little distortion to the RNA 
helix structure [47] and should not have such a severe 
effect on telomerase as the other ones. The G-TJ pair 
in this position is found only in the Asian house shrew 
and the northern quoll. Life expectancy of the northern 
quoll is 7 years and about 3 years for shrews [48]. Thus, 
there is no evident correlation between the existence of 
the G-TJ pair in a particular position in the pseudoknot 
structure and life expectancy. 
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The polymorphism G~»A at position 421 leads to the 
formation of a C~A non-canonical pair in the p8b stem. 
Most mammals have the canonical base pair at this po- 
sition. It should be mentioned that disruption (C->G in 
C~G) of 408-421 base pairs in humans leads to dysk- 
eratosis congenita [46]. Rodents (chinchilla, guinea pig, 
mice) have polymorphisms that cause distortion of the 
p8b stem. The G~>A transition in H. glaber belongs to 
the same class of species-dependent variations, but this 
particular substitution is unique to H. glaber. P8b is a 
part of the CR7-p8b (H/ACA) domain. CR7 is required 
for 3 '-end processing, localization, and the stability of 
hTER [49]. CR7 contains a conserved Cajal body locali- 
zation element (CAB box) [50]. The telomerase Cajal 
body protein 1 (TCAB1) binds to the CAB box [51] and 
drives hTER to the Cajal body. TCAB1 knockdown 
prevents telomerase-telomere association and results 
in telomere shortening [52]. For H. glaber, the non-ca- 
nonical pair "C-A" in the p8b stem loop can improve 
the interaction between hgTCABl and hgTER, make 



telomerase traffic more effective and thus result in a 
more efficient telomere elongation. • 

CONCLUSIONS 

Comparison of hgTER and other telomerase RNA 
genes suggests that both the unique structure of the 
promoter region and the specific polymorphisms in 
the functional domains can cause increased expres- 
sion of the telomerase RNA gene in stem cells, thus 
reducing replicative senescence and increasing the 
lifespan. We hope that our finding of a difference in 
the promoter region of telomerase RNA will inspire 
other researchers to study these processes using in 
vivo mouse models. 
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